MRC: Multi Relational Clustering approach
نویسندگان
چکیده
— Clustering is a process of partitioning data objects into groups based on the similarity measures. Most of the existing methods perform clustering within a single table, but most of the real-world databases, however, store information in multiple tables. We propose a new method which is called Multi Relational Clustering (MRC) for clustering a relational database. The MRC approach uses existing clustering algorithms for clustering every table of database. Tables in a database are related to each other based on foreign keys. The MRC approach divides the tables into two categories: dependent and independent tables. A dependent table is a table that includes entities attributes, as well as fields related to the other entities which belong to the other tables. In fact a dependent table includes one or more foreign keys. The MRC approach firstly, clusters independent tables then utilizes these results for clustering dependent tables. The MRC clusters each table by existing clustering algorithm with respect to its fields. An important feature of the MRC approach is ability of clustering several tables in parallel. The proposed approach is very simple and is developed under SQL very efficiently. We offer a version of implementation of k-Means in SQL and use it for clustering a database by MRC approach. Our experiments show that the MRC is efficient for clustering a huge database in a relational environment.
منابع مشابه
Multi-objective optimization in WEDM of D3 tool steel using integrated approach of Taguchi method & Grey relational analysis
In this paper, wire electrical discharge machining of D3 tool steel is studied. Influence of pulse-on time, pulse-off time, peak current and wire speed are investigated for MRR, dimensional deviation, gap current and machining time, during intricate machining of D3 tool steel. Taguchi method is used for single characteristics optimization and to optimize all four process parameters simultaneous...
متن کاملA Hybrid Grey based Two Steps Clustering and Firefly Algorithm for Portfolio Selection
Considering the concept of clustering, the main idea of the present study is based on the fact that all stocks for choosing and ranking will not be necessarily in one cluster. Taking the mentioned point into account, this study aims at offering a new methodology for making decisions concerning the formation of a portfolio of stocks in the stock market. To meet this end, Multiple-Criteria Decisi...
متن کاملMulti-type Relational Clustering Approaches: Current State-of-the-Art and New Directions
The proliferation of multi-type relational datasets in a number of important real-world applications and the limitations resulting from the transformation of such datasets to fit propositional data mining approaches have led to the emergence of the discipline of multi-type relational data mining. Clustering is an important unsupervised learning task aimed at discovering structure inherent in da...
متن کاملResearch on Multi-relational Clustering Problems Based on Proper-link
A relational database contains a wealth of information, and multi-relational clustering can be obtained by adopting proper-link. The similarities between prospers and linked calculation target topples can be worked out through searching for relevant multi-relational prospers and links of user clustering target, then the clustering can be completed by selecting CLARANS. In order to promote the e...
متن کاملClustering Approach to Generalized Pattern Identification Based on Multi-instanced Objects with DARA
Clustering is an essential data mining task with various types of applications. Traditional clustering algorithms are based on a vector space model representation. A relational database system often contains multirelational information spread across multiple relations (tables). In order to cluster such data, one would require to restrict the analysis to a single representation, or to construct ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009